Add `cupy` backend for GPU support #1048

s3alfisc · 2025-10-23T18:55:06Z

We add a cupy backend https://docs.cupy.dev/en/latest/index.html to run FWL on the GPU.

Strategy: "direct" FWL OLS residualization instead of iterative solver.

Should help with problems with slow convergence.

Only tested on CPU via scipy, where results are matching. Cannot test on GPU as no Mac / no access to Cuda GPU.

Note: we can build a backend using a similar strategy via JAX and torch.

import pyfixest as pf

data = pf.get_data()

fit_rust = pf.feols("Y ~ X1 + X2  | f1", data = data, demeaner_backend="rust")
fit_cupy = pf.feols("Y ~ X1 + X2  | f1", data = data, demeaner_backend="cupy")

(pf.etable([fit_rust, fit_cupy], type = "md", digits = 8))
index                    est1             est2
------------  ---------------  ---------------
depvar                      Y                Y
----------------------------------------------
X1            -0.94952557***   -0.94952557***
                 (0.06637281)     (0.06637281)
X2            -0.17422527***   -0.17422527***
                 (0.01759569)     (0.01759569)
f1                          x                x
----------------------------------------------
Observations              997              997
----------------------------------------------
S.E. type                 iid              iid
R2                 0.48899391       0.48899391
R2 Within          0.23875790       0.23875790
----------------------------------------------

codecov · 2025-10-23T19:05:52Z

Codecov Report

❌ Patch coverage is 23.94366% with 108 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
pyfixest/estimation/cupy/demean_cupy_.py	21.13%	97 Missing ⚠️
pyfixest/estimation/demean_.py	0.00%	6 Missing ⚠️
pyfixest/estimation/backends.py	54.54%	5 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (70a154c) and HEAD (6388b11). Click for more details.

HEAD has 2 uploads less than BASE

Flag BASE (70a154c) HEAD (6388b11)

core-tests 2 0

Flag	Coverage Δ
core-tests	`?`
tests-vs-r	`15.82% <23.94%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
pyfixest/estimation/estimation.py	`9.19% <ø> (-81.61%)`	⬇️
pyfixest/estimation/literals.py	`68.75% <100.00%> (-18.75%)`	⬇️
pyfixest/estimation/vcov_utils.py	`19.01% <100.00%> (-20.71%)`	⬇️
pyfixest/estimation/backends.py	`72.41% <54.54%> (-10.92%)`	⬇️
pyfixest/estimation/demean_.py	`12.09% <0.00%> (-41.30%)`	⬇️
pyfixest/estimation/cupy/demean_cupy_.py	`21.13% <21.13%> (ø)`

... and 38 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

s3alfisc · 2025-10-23T19:07:42Z

TODO

add tests
create GPU env
maybe fix numpy to pandas to sparse array conversion? could lead to poor performance

review-notebook-app · 2025-10-31T20:52:37Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

s3alfisc · 2025-11-02T11:14:45Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

# Please enter a commit message to explain why this merge is necessary, # especially if it merges an updated upstream into a topic branch. # # Lines starting with '#' will be ignored, and an empty message aborts # the commit.

initial commit

8a1d757

s3alfisc linked an issue Oct 23, 2025 that may be closed by this pull request

pyfixest is MUCH slower than (Stata + Julia) reghdfejl #1042

Open

s3alfisc mentioned this pull request Oct 23, 2025

pyfixest is MUCH slower than (Stata + Julia) reghdfejl #1042

Open

s3alfisc added 15 commits October 23, 2025 21:35

tweaks

b8a4130

some gpu fixes

434cedb

32 vs 64 option

05a72b3

clarification

0a4c9e6

zero copy df

56a4cdf

np.ndarray

5fb0b6a

data conversino

508a5fa

delete

7bab197

tweaks

9bdbb28

update tests

65b18a5

pendantic benchmark

0b3008c

tests

bc06821

pass cupy tests

5dbb354

dedicated scipy backend

b1aee99

add other demeaners to backends

c3f36b1

s3alfisc added 9 commits October 31, 2025 23:03

update timings@

3be6682

nb to script

87f2e27

nb to script

d90639a

bench updates

22a077b

update benchmark script

0be7b06

update docs

013a3ac

update docs

5b5370e

more docs upadtes

10def15

update lock file

e8915d0

pre-commit-ci bot and others added 15 commits November 2, 2025 11:15

[pre-commit.ci] auto fixes from pre-commit.com hooks

07b2c62

for more information, see https://pre-commit.ci

ci update

093854b

adjust demean test strategy

29ceffe

delete dev env

2db6f53

fail fasterÄ

d854fb0

bring back bench env

11836ec

move hac tests for faster ci

f3a0d2e

Merge master into cupy branch

79ef46a

internal sort

c60549a

use rng

216e00d

merge master to cupy

6b98b73

bring back jit

09f21e2

update docs

9b7a575

docstrings

6388b11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `cupy` backend for GPU support #1048

Add `cupy` backend for GPU support #1048

Uh oh!

s3alfisc commented Oct 23, 2025

Uh oh!

codecov bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

s3alfisc commented Oct 23, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Oct 31, 2025

Uh oh!

s3alfisc commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add cupy backend for GPU support #1048

Are you sure you want to change the base?

Add cupy backend for GPU support #1048

Uh oh!

Conversation

s3alfisc commented Oct 23, 2025

Uh oh!

codecov bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

s3alfisc commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Oct 31, 2025

Uh oh!

s3alfisc commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add `cupy` backend for GPU support #1048

Add `cupy` backend for GPU support #1048

codecov bot commented Oct 23, 2025 •

edited

Loading

s3alfisc commented Oct 23, 2025 •

edited

Loading